Coping with Derivation in the Bulgarian Wordnet
نویسندگان
چکیده
The paper motivates a strategy for identification and annotation of derivational relations in the Bulgarian wordnet that aims at coping with the complex morphology of the language in an elegant way. Our method involves transfer of the Princeton WordNet (morpho)semantic relations into the Bulgarian wordnet, at the level of the synset, and further detection of derivational relations between literals in Bulgarian. Derivational relations have been annotated to reflect the complexity of Bulgarian morphology. Introduced literal relations improve the consistency and employability of the
منابع مشابه
Towards a Complex Model for Morpho-Syntactic Annotation
We present the first results in the development of a multilingual resource that should enable the exploration of the possibility to apply various different lexical bases, such as edictionaries in LADL format and multilingual lexical databases like Wordnet and Prolex that were developed during the last decade. We discuss the problems of morphosyntactic annotation of the phenomena of regular deri...
متن کاملChallenges Behind the Data-driven Bulgarian WordNet (BulTreeBank Bulgarian Wordnet)
The paper presents our work towards the simultaneous creation of a data-driven WordNet for Bulgarian and a manually annotated treebank with semantic information. Such an approach requires synchronization of the word senses in both syntactic and lexical resources, without limiting the WordNet senses to the corpus or vice versa. Our strategy focuses on the identification of senses used in BulTree...
متن کاملClassification of Adjectives in BulNet: Notes on an Effort
The paper presents an overview of an attempt at the semantic classification of adjectives in the Bulgarian Wordnet based on the information that is already available in WordNet, and other classifications proposed in the literature (classifications in the linguistic literature for Bulgarian and approaches implemented by other wordnets, more precisely, the Wordnet for German). The proposed approa...
متن کاملConstructing of an Ontology-based Lexicon for Bulgarian
In this paper we report on the progress in the creation of an Ontology-based lexicon for Bulgarian. We have started with the concept set from an upper ontology (DOLCE). Then it was extended with concepts selected from the OntoWordNet, which correspond to Core WordNet and EuroWordNet Basic concepts. The underlying idea behind the ontology-based lexicon is its organization via two semantic relati...
متن کاملHydra: A Software System for Wordnet
This paper presents an overview of the software for wordnet processing Hydra. The system has fully-fledged GUI and API, both working with powerful modal query language. Hydra has been used for the development of the Bulgarian WordNet for the last 7 years and recently was improved, became open source and is distributed as part of the Meta-Share platform.
متن کامل